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A SYSTEM FOR OPTIMAL RESOURCE ALLOCATION 
AND PLANNING FOR HOSTING COMPUTING SERVICES 

Field of Invention 

The invention relates generally to hosting services in a networked environment 
and in particular to the planning and allocation of resources in such networked 
environments. 

Background 

Companies that go online require (i) a multitude of application services and 
(ii) high end servers to run these applications. Examples of relevant application services 
include Hyper Text Transfer Protocol (HTTP), Domain Name System (DNS), File 
Transfer Protocol (FTP), proxy, firewalls and business applications. To achieve this, such 
companies face considerable overheads in installing and updating the applications and 
maintaining the servers. This drives the emergence of a new business model, namely, 
hosting applications over the web. 

The demand for this business model is growing at a rapid pace, because of 
significant opportunities in hosting e-commerce and e-business applications for Small and 
Medium Businesses (SMBs). SMB managers often seek to outsource application hosting 
to service providers to avoid the capital expenses incurred by computing resources, 
software and maintenance. In this respect, the role of service providers is significant. 



Hosting services cover a wide range of capabilities in three broad categories: 

(i) Collaborative hosting services include email, internal and external 
webhosting, collaborative applications and scheduling applications; 

(ii) Commerce hosting services include business applications that enable 
online Business to Business (B2B) trading and Business to Consumer 
(B2C) e-commerce hosting; and 

(iii) Other e-business hosting services include custom-built and Enterprise 
Resource Planning applications. 

Examples of such hosting services can be found at the following URLS: 
Netscape Hosting Services White Paper, at 
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http://home.netscape.eom/hosting/v4.0/whitepaper/; "Building a Scalable and Profitable 
Application Hosting Business", White Paper, Ensim Corporation, at 
http://www.ensim.com/ProductsAindex.htm; IBM : developerWorks : Web architecture 
library, "IBM Application Framework for e-business: Application hosting services", 
5 Stewart Nickolas, IBM, October 1999, at 

http://www-4.ibm.com/software/developer/library/ahs/; and "Hosting with Exchange 
2000 Server", White Paper, at http://www.microsoft.com/exchange/techinfo/ASP 
planning.htm. 

10 These hosting services are especially relevant for SMBs, which can access highly 

functional applications without having to incur a large capital expenditure while 
maintaining the focus on core competencies. In the absence of such hosting services, the 
SMB segment has to afford dedicated services and knowledge about the resource 
requirements such as a central processing unit (CPU), Random Access Memory (RAM) 

1 5 and business applications for running their e-businesses, which can clearly not be a 
cost-effective solution. 

From the service provider's perspective, hosting quality services for the SMB 
necessitates providing flexible, cost-effective services within a Quality of Service (QoS) 
20 guarantee. Planning and optimizing the resource requirements for hosting such services, 
based on the requirements of its clients, is an important objective for a service provider. 

Different approaches for hosting services are described briefly at the following 
URL: Building a Scalable and Profitable Application Hosting Business 55 , White Paper, 
25 Ensim Corporation, at http://www.ensim.com/Products/index.htm. These different 
approaches are described briefly hereinafter. 

Dedicated Services 

The dedicated services model dictates that a client be given an integral number of 
30 servers by reserving these exclusively for the client. However, the use of this approach in 
a SMB context, where a shared server may suffice, may be too expensive. Also in a large 
enterprise context, using this approach may be prohibitive, since huge reservations may be 
needed to ensure service during limited peak hours. 
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Shared Services 

The shared services model is a popular method used by service providers, 
especially for virtual web hosting and applications designed specifically to support 
multiple customers on a single instance of an application. In this approach, multiple 
5 customers and services are hosted on a single server. However, in a shared environment, 
QoS is difficult to guarantee to clients. A surge in traffic of one client may adversely 
affect a co-located client. Reliability and security are other issues, which are difficult to 
address in this approach. 

10 Virtual Servers 

The concept of virtual servers is relatively new, but appears to be on the verge of 
becoming mainstream. Virtual Server technology involves creating multiple logical 
servers on a single system, enabling individual users to have equal access to computer 
resources, such as RAM and CPU. The technology enables service providers to use 

1 5 resources more efficiently. Since virtual servers behave like independent physical 

machines, the virtual services can run more complicated applications such as databases, 
security and telnet on a shared server. However, while helping in providing fractional 
solutions, the virtual servers still behave like small independent servers with limited 
resources. Thus, to guarantee QoS, the service provider must allocate resources for 

20 handling the peak loads of its clients, which can be much larger than the average load on 
the system. 

In a conventional system, a client desiring a good QoS must provide for some 
percentile of the peak load observed. The service provider on the other hand must 
25 therefore reserve resources for each client to service the client at peak load. This 

disadvantageous^ results in reservation of resources that lie mostly idle in a non-peak 
load condition. 

Thus, a need clearly exists for a system of planning and allocating 
30 hosting-service resources in an optimal manner so as to overcome or at least ameliorate 
one or more of the disadvantages of conventional approaches to providing hosting 
services. 
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Summary 

In accordance with a first aspect of the invention, a method for allocating 
hosting-service resources to clients in at least one shared server is disclosed. The method 
includes the steps of: 
5 discovering utilization patterns of the clients; and 

allocating the resources to the clients dependent on the utilization patterns. 

Preferably, the method further includes the step of providing bounds specifying 
minimum and maximum hosting-service resources for each of clients, allocating step also 
1 0 being dependent upon bounds. 

Preferably, the method further includes the step of modeling dimensions for 
client use measures and utilization patterns. The method may further include the step of 
packing clients using stochastic vector packing. Further, the packing step may utilize at 
1 5 least one of the processes selected from the group consisting of a Roof Avoidance 
process, a Minimized Variance process, a Maximized Minima process, and a Largest 
Combination process. 

Preferably, the hosting-service resources relate to at least one hosting service 
20 selected from the group consisting of collaborative hosting services, commerce hosting 
services, and e-business hosting services. 

Preferably, the allocating step effects a Quality of Service (QoS) guarantee. 

25 Preferably, the utilization patterns are dependent upon access rates of one or 

more websites, the access rates have periodicity on multiple time scales. Two or more 
clients may be selected from several clients on the basis of complementarity, wherein the 
hosting-service resources are allocated to the selected two or more clients as a 
combination. Further, the allocating step may include the step of selecting said two or 

30 more clients to be allocated to a server. The two or more selected clients may each have a 
peak load that is substantially disjoint in time in relation to a peak load of the remaining 
other selected clients. Further, the allocated host-service resources may include resources 
allocated exclusively to each of the selected two or more clients and shared resources 
allocated to the combination for use by the selected two or more clients. Still further, N 
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clients may be selected and allocated to a server, with N being an integer greater than or 
equal to two. The server may be partitioned into N virtual servers, each client being 
exclusively allocated a corresponding one of the N virtual servers. Excess capacity of the 
server beyond the capacity required to provide the N virtual servers is shared by the N 
5 clients. 

Preferably, the method further includes the step of monitoring the clients to 
discover the utilization patterns. 

10 In accordance with a second aspect of the invention, an apparatus for allocating 

host-service resources to clients in at least one shared server is disclosed. The apparatus 
includes: a device for discovering utilization patterns of the clients; and a device for 
allocating the resources to the clients dependent on the utilization patterns. 

15 in accordance with a third aspect of the invention, a computer program product 

having a computer readable medium having a computer program recorded therein for 
allocating host-service resources to clients in at least one shared server is disclosed. The 
computer program product includes: a computer program code module for discovering 
utilization patterns of the clients; and a computer program code module for allocating the 

20 resources to the clients dependent on the utilization patterns. 

In accordance with a fourth aspect of the invention, a decision support system for 
allocating and planning resources in hosting computing services is disclosed. The 
decision support system includes: 
25 a device for modeling utilization of resources of one or more servers by clients in 

response to at least one of utilization patterns of the clients and specified rules regarding 

quality of service; and 

a device for determining a minimum number of servers for accomodating the 
clients to ensure a specified minimum quality of service. 



30 



Preferably, the determining device utilizes stochastic vector packing. The system 
facilitates optimal management of resources in the hosting computing services. 

Preferably, the hosting computing services include hosting computing resources, 
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computing applications, computing-related services, and network bandwidth. 

Preferably, the device for generating for a service provider a set of suggestions 
for optimal resource planning and allocation. 

5 

Preferably, the system provides an optimization service for use in a business 
model hosting optimization applications. 

In accordance with a fifth aspect of the invention, a decision support method for 
10 allocating and planning resources in hosting computing services is disclosed. The method 
includes the steps of: modeling utilization of resources of one or more servers by clients 
in response to at least one of utilization patterns of the clients and specified rules 
regarding quality of service; and determining a minimum number of servers for 
accomodating the clients to ensure a specified minimum quality of service. 

15 

In accordance with a sixth aspect of the invention, a computer program product 
having a computer readable medium having a computer program recorded therein for 
providing decision support to allocate and plan resources in hosting computing services. 
The computer program product includes: a computer program code module for modeling 
20 utilization of resources of one or more servers by clients in response to at least one of 
utilization patterns of the clients and specified rules regarding quality of service; and a 
computer program code module for determining a minimum number of servers for 
accomodating the clients to ensure a specified minimum quality of service. 

25 Brief Description of the Drawings 

A small number of embodiments are described hereinafter with reference to the 

drawings, in which: 

Fig. 1 is a block diagram of a system for optimally allocating and planning 
30 resources for hosting computing services in accordance with a preferred embodiment of 
the invention; 

Fig. 2 is a detailed block diagram of a decision support system included in the 
system of Fig. 1; 
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Fig. 3 is a flowchart of a method for optimally allocating and planning resources 
for hosting computing services in accordance with the preferred embodiment of the 
invention; and 

Fig. 4 is a block diagram of a general purpose computer, with which 
embodiments of the invention may be practiced. 



Detailed Description 

10 A method, an apparatus, and a computer program product are disclosed for 

optimally allocating and/or planning resources for hosting computing services (or simply 
hosting services). In the following description, numerous details are set forth including 
particular hosting services such as email, webhosting, and the like. It will be apparent to 
one skilled in the art, however, that the present invention may be practised without these 

1 5 specific details. In other instances, well-known features are not described in detail so as 
not to obscure the present invention. 

In the following description, components of the hosting-service resource 
allocation and/or planning system are described as modules. A module, and in particular 

20 its functionality, can be implemented in either hardware or software. In the software 
sense, a module is a process, program, or portion thereof, that usually performs a 
particular function or related functions. In the hardware sense, a module is a functional 
hardware unit designed for use with other components or modules. For example, a 
module may be implemented using discrete electronic components, or it can form a 

25 portion of an entire electronic circuit such as an Application Specific Integrated Circuit 
(ASIC). Numerous other possibilities exist. Those skilled in the art will appreciate that 
the system can also be implemented as a combination of hardware and software modules. 

The preferred embodiment provides a system for optimally allocating and 
30 planning hosting service resources, and more particularly a decision support system, that 
assists a service provider in planning and allocating computing resources, computing 
applications and other computing services. In this embodiment, stochastic modeling and 
methods for resource allocation and planning, which meet customer needs and provide a 
Quality of Service (QoS) based on utilization patterns, are also disclosed. The 
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embodiments of the invention provide a decision support system that assists service 
providers in providing hosting services in a manner that satisfies customer requirements 
and QoS. 

5 Overview 

The embodiments of the invention advantageously provide a decision support 
system for optimal resource planning in web farms that host computing services, based on 
the requirements of the clients. The resources are allocated to the clients based on 
suggestions given by the decision support system to ensure better QoS guarantees to the 
1 0 clients, while requiring fewer resources. 

The resource utilization of various web-sites or web applications is assumed to 
be closely correlated to the access rates of the same. Access rates of web-sites are 
observed to have periodism on multiple time scales in nature. In the embodiments of the 

15 invention, these patterns are used to arrive at suitable combinations of clients, such that a 
set of clients in the same combination are co-hosted on a shared server. As far as 
possible, the combinations are chosen so that the clients in the same combination 
experience a peak load in disjoint time periods. For example, one client having a weather 
forecast application may be accessed mainly in the morning, while another client requires 

20 stock quotes that may be accessed in the evening. Also, for example, sites accessed from 
the United States and India are more heavily loaded at alternate times in the day, 
considering the difference in time zones. 

For the purpose of choosing such client combinations, the concept of 
25 complementarity is defined, which is used to determine clients that have peak periods in 
almost disjoint intervals. 

Unlike conventional systems, the embodiments of the invention provide a 
method in which the provider does not reserve resources exclusively for each client to 
30 service the client's peak load, but instead reserves resources for a combination of clients 
that can share the resources as per their time-dependent requirements. 

The combinations of clients that are co-hosted on a shared server are chosen such 
that the clients in the same combination experience peak loads at different times. The 
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service provider may also reserve a certain amount of resources exclusively for each client 
in a combination. Thus, a client has access to two types of resources, one that has been 
exclusively reserved for the client as well as the shared resource reserved for the 
combination of clients, of which the client is a part. 

5 

Providing resources for the clients in such combinations has a significant 
advantage. Each client can be guaranteed a minimum amount of resources and also 
guaranteed more than the minimum with a certain probability. Since some resources are 
reserved exclusively for each client, a peak load of a co-hosted client does not adversely 
1 0 affect other clients hosted on the same shared server oh the average. This method 
provides better QoS for the same number of clients and resources. In addition, the 
method according to the embodiments requires fewer resources to provide the same QoS 
for the same number of clients with the same resource utilization rates. 

15 Client Workload Characterization 

The client resource requirement is modeled as a stationary stochastic process. Web 
access rates are typically distributed differently during the day. This fact is captured by 
dividing the day into k time slots (e.g., k=24 and each time slot is an hour long) and 
modeling the resource requirements of each client as a different stationary stochastic 

20 process in different time slots and for different resources. For r resources and k time 
slots, the resource requirement of each client is modeled as a random vector having 
d = r x k dimensions. Thus each dimension of the requirement vector represents a 
particular resource in a specific time interval. 

25 The resource requirement process is assumed to have the same distribution each 

day. Although the model is described for a 24-hour period, another period that is better 
suited to the data may be chosen. Let the random variable XJij} denote the requirement 
of client i at any time t in the time interval corresponding to dimension j (due to the 
stationarity assumption, this random variable has a distribution that is independent oft). 



30 



As a performance guarantee each client i negotiates an agreement whereby the 
client is always allocated a minimum resource requirement a_{ij}(at least 0), and can 
specify a maximum requirement of b_{ij} (less than or equal to the capacity of a server) 
for all j = l,...,d. Notice that the minimum and maximum requirements are allowed to 
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vary over time and resources. 

Let Y_{ij} denote the capacity promised to client i in dimension j. Then from a 

* 

probability distribution viewpoint: 

YJij } = a_{ij } if X_{ij } < a_{ij } 

= X_{ij } if a_{ij } <= XJij } <= b_{ij } 
= b_{ij} otherwise. 

I o The past access pattern of client i is used to estimate the distribution of X_{ij } 

and hence of Y_{ij} for j=l,2,...,d. 

Alpha-satisfiability is introduced as a measure of QoS. A client is said to be 
alpha-satisfied if the client receives a promised capacity at least alpha proportion of time 
15 in each dimension. 

Let clients C_l, C_2,...,C_n be allocated to a server and let Q = (Q_l, ...,Q_d) 
denote the server capacity in dimensions l,...,d. This allocation is alpha-satisfiable if for 
every j = l,...,d, the probability, 

P( Y_{ lj } + Y_{2j} + ... + Y_{nj} >= QJ) <= 1 - alpha, 
where alpha is a scalar constant close to 1 . This is a much stronger guarantee than the one 
that ensures that during a period of 24 hours a client is given promised capacity alpha 
proportion of time. In that case client may not be given the desired capacity during his 
peak requirement period. 



20 



25 



30 



Resource Allocation Problem 

Given N clients, the minimum number of servers needed to host the clients is 
sought, subject to alpha-satisfiability as defined above. Each client resides on one of the 
servers, sharing the server with the other clients on the server. 

This problem^can be solved through a vector-packing approach. Vector-packing 
is a generalization of bin-packing to multiple dimensions. Bin-packing is an integer 
programming problem where items of various sizes are to be packed into bins of a given 
capacity, so as to minimize the number of bins. Multidimensional bin-packing packs 
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volumes into a multi-dimensional space. However, the generalization needed must ensure 
that the jobs (clients) do not overlap in any dimension. 



Servers are not required to have equal capacity Q. However, the capacity needs 
to be normalized by scaling different resources as follows. Let CTiJ be the capacity of 
the ith type of server in the jth dimension. Let CTmaxJ be the the maximum value of 
C^i J over all i types of servers. Then, the units in which each resource is measured are 
scaled so that CTmaxJ is the same for all dimensions j = l,...,d. 

A part of each server's resources is partitioned among the clients on the server, 
and the remaining resources are available as a common pool. Each server is modeled as a 
collection of resources, such as CPU, memory and bandwidth, with capacity constraints 
on each one. The ith client's partition has size corresponding to its minimum 

requirements a_{ij}, j=l d. The effect of this partition is to give each client protection 

from load surges from other clients on the same server. 

It is important to note here that, only those entities are considered resources, 
which are considered significant in resource planning. It is desirable to have fewer 
resources and time intervals, both to reduce computational complexity as well as to 
improve the resource utilization. Only a single abstract resource could model a system 
where there is a high correlation between the load on these resources, or if there is a single 
bottleneck resource. The choice of the set of resources is left to the provider. 

System and Method of Preferred Embodiment 

Fig. 1 is a block diagram of a system for optimally planning and/or allocating 
resources for hosting services in accordance with a preferred embodiment of the 
invention. The system 100 includes a monitoring system 1 12, a decision support system 
(DSS) 1 16, and a module for allocating resources to clients 122. The monitoring system 
1 12 is coupled to the Internet or web 1 10 and monitors client accesses for resources. The 
monitoring system 1 12 consequently produces a utilization pattern database 1 14, which is 
provided as input to the decision support system 1 16. Further, human 120 input can be 
used to produce rules (SLA) 118, which are also provided as input to the decision support 
system 116. The decision support system 1 16 provides suggestions to a module 122 for 
allocating resources optimally to clients. The system 100 obtains utilization/access 
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patterns from the utilization database 1 14 and preferably the contractual/service level 
agreement (SLA) from an SLA database (not shown). After obtaining the SLA 
rules/access data from relevant databases, the DSS 1 16 outputs suggestions for optimal 
resource planning and allocation to a human interface. A person may pick a suggestion 
from one of these outputs or look for an alternate set of suggestions by changing the SLA 
rules in the SLA database. 

Fig. 2 is a more detailed block diagram illustrating the decision support system. 
In the system 200 of Fig. 2, utilization patterns 210 and rules (SLA) 220 are provided as 
input to the decision support system 230. The decision support system 230 in turn 
includes a modeling module 232 and a stochastic vector-packing module 234. Human 
input 240 can be provided to both modules 232 and 234. The modeling module 232 
receives the inputs 210 and 220 and is in turn coupled to the vector module 234. The 
vector-packing module 234 in turn provides suggestions as output to a module for 
allocating resources to clients 250. Again, a person can intervene in deciding the model 
and the method on which the system may work to arrive at the set of suggestions. 

Fig. 3 is a flow diagram illustrating the method 3 00 of optimally planning and/ or 
allocating hosting-services resources in accordance with the preferred embodiment of the 
invention. The process 300 commences in step 3 10. In step 3 10, an initial set of clients C 
is provided. In step 3 12, a new server S is obtained. In decision block 3 14, a check is 
made to determine if there are any clients left remaining to be processed. If decision 
block 314 returns false (No), processing continues in step 316 and resource allocations are 
output to the clients. Otherwise, if decision block 314 returns true (Yes), processing 
continues at step 318. 

In decision block 318, alpha-satisfiable clients are found for the new server S 
from the set of clients C. In decision block 320, a check is made to determine if any such 
clients were found in step 318. If decision block 320 returns false (No), processing 
continues at step 312. Otherwise, if decision block 320 returns true (Yes), processing 
continues at step 322. 

In step 322, the most complementary client c' is found. In step 324, the client c' 
is allocated to the new server S and removed from the set of clients C. Processing then 
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continues at decision block 314. 
Packing Heuristic 

Clients C_1,...,C_N are to be packed into servers. During the progress of the 
process 300 clients C_n,..., C_N are left to be packed, n <= N, where N is the total 
number of clients. The ith client is represented by a resource usage vector 
y j = (Y {il},. ,Y_{id}), where each element is a random variable. 

Servers are chosen to be packed in a sorted sequence. For example, the servers 
could be in decreasing order of mean capacity. If there is a set of dimensions of higher 
priority, the servers may be sorted in mean capacity over only these dimensions, e.g., one 
may want to prioritize the dimension that corresponds to a peak time period and an 
expensive resource. 

At a given time, only one server is open for packing, clients being added to the 
server one-by-one. Let Q = (Q_{1 },.,Q_{d}) be the capacity of this server. At the time 
a client is added, either the server is empty, or there are existing clients in the server. Let 
CJB be the set of clients already in the current server. Let B = (B_l,...,B_d) denote the 
distribution of the resource utilized in the current server due to the clients already in the 
server. Let E(B)=(E(B_1 ),. . ., E(B_d)) be the vector of expected values of the resource 
utilized in the dimensions of the current server. 

If the ith client is chosen as a suitable addition to the current server, the server's 
resource utilization vector is updated to the element-by-element convolution of Y_i with 
B. The new server distribution B_new = (B_l*Y_{il },..., B_d*Y_{id» = B*Y_i, where 
the * symbol represents the convolution operation. Let C_alpha denote the subset of 
clients {C_n,..., C_N} that are alpha-satisfiable with the current server. 

The idea of complementarity is used to allocate a client to the current server. 
The following heuristics may be used for finding the most complementary client for the 
current server: 

Roof Avoidance 

Let E(Y {il}), . ,E(Y_{id}) denote expected values resource requirement of 
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Y_{il},...,Y{id}, for the ith client. 

Let D be the dimension where the current server has the highest mean 
requirement, i.e., E(B_D) is the maximum among E(B_l),...,E(B_d). The mth client is 
5 said to be the most complementary client if, E(Y_{mD}) is the minimum among all those 
E(Y_{iD}) such that the ith client C_i is in the set C_alpha (i.e., C_m is the client that has 
the smallest mean in the dimension D, and that is also alpha-satisfiable). 



Minimized Variance 

1 o Let the empty space in the server be denoted by e = (e_l , ,e_d), where e J i s the 

maximum amount of resource that may be taken away from this dimension so that the 
current server stays alpha-satisfiable. Denote the empty space in the current server if the 
ith client was added to the server by e'H =(e_l A i,...,e_d A i). The variance of e*i is called 
the empty space variance of the ith client. 



15 



20 



Now the mth client is the most complementary client if the mth client gives rise 
to the least empty space variance among all the clients in C_alpha. This variation 
attempts to have equalize the utilization over all dimensions, thereby preventing 
saturation in any one dimension. 



Maximized Minima 

Processing begins by finding C_m as in the Minimized Variance heuristic. Let 
C_v be a set of clients (subset of C_alpha) with empty space variance close to that of 
C_m, i .e., if C_i is in C_v, then the difference between the empty space variance of C_i 
25 and C_m is less than a scalar constant V. 

Define the maximum free space fi for the ith client in C_v as the maximum 
ej *i over all dimensions j . The most complementary client is now updated to C_m, such 
that fm the minimum among all fi, for all clients in C_v. The motivation here is to 
30 keep the variance low with also a low maximum empty space. 

Largest Combination 

The set of clients C_v, which have a low empty space variance, is found as in 
Maximized Minima. The mean of e*i is called the mean empty space. The most 
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complementary client is C_m has the least mean empty space among all the clients in 
C_v. The motivation here is to keep the variance low with also a high overall usage of 
the server. 

With the above procedures of finding the most complementary client, the 
packing proceeds as set forth in Table 1 : 

TABLE 1 

Let C be the set of clients, { C_1,...,C_N } 
While (C not empty) 

Pick a new server B 

The set of clients C_B is empty 

Let C_alpha be the alpha-satisfiable client set for server B 
While (C_alpha not empty) 

C_m is found as the most-complementary-client from C_alpha 

Update the server probability distribution vector B 

Add C_m to C_B 

Remove C_m from C 

Update C_alpha from the new C for the new distribution 

End 
close B 

End 



The process of Table 1 finds its inspiration from one dimensional bin-packing 
together with the use of variance amongst the dimensions. The one-dimension 
bin-packing problem has been well studied. In that context off-line algorithms such as 
First-Fit Decreasing (FFD) and Best-Fit Decreasing (BFD) have been found effective with 
a worst case approximation bound of 1 1/9. Our first approach is in the spirit of Best-Fit 
Decreasing. However the emphasis of the multi-dimensional processes defined above is 
to reduce variance and thus avoid over-utilization of any one single dimension. The 
one-dimensional approach that attempts to bucket items into size classes is termed as 
Modified First-Fit Decreasing and is reported to improve on FFD. Our Maximized 
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Minima approach performs a bucketing also but the criterion of the bucketing is the 
variance. 

Implementation Issues 

5 For a new client, resource utilization history may not be available. Unrestricted 

resources are allowed to such a client, initially. This allows characterization of the client's 
probability distribution. After the initial period, the distributions are continually updated 
for re-optimization of the layout. A new client is added to the most complementary server 
that is alpha-satisfiable with the addition of the client. If no such server can be found, 

1 0 then a new server is opened and add the client to this new server. 

To support a change in customers requirements, change in utilization patterns, or 
deletion of clients from the farm, the client is removed from the server and the resources 
allocated to that server are freed. Then the client with new specifications and frequency 
1 5 distribution tables is added to the most complementary server, afresh. After a number of 
such additions, deletions, and reallocations the server farm may no longer have a close to 
optimal resource plan. It is thus desirable to have periodic maintenance wherein the entire 
process of allocation is repeated afresh for all currently hosted clients. 

20 By treating larger sets of servers as one server, or by splitting larger clients, the 

requirement that each client's requirements always be less than one single server can be 
overcome. If some clients need to be replicated across servers, the system treats each 
replica as an independent client. 

25 The process does not restrict the system from having high availability and fault 

tolerance. For high availability, each client may be split into two clients with the 
constraint that each half should be located on a separate machine. For replication, the 
system treats each client's replica as an independent client. The process simply includes a 
check for such clients so that such clients are not packed on the same server. To account 

30 for transient heavy demands, a safe capacity margin may be allowed in each server. 

The system supports priority classes of clients. Each priority class has an 
alpha-satisfiability associated with the class. Each server is packed for one such class. 
Once a higher priority group has been packed, the servers are rearranged in order of 
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decreasing empty space, and the process runs for the next lower priority group over all the 
servers. Since each server has a single alpha that the server is packed for, some lower 
priority clients may get packed into higher priority servers. Thus, resources are conserved 
while exceeding the QoS promised to certain clients. 

The embodiments of the invention are preferably implemented using a 
general-purpose computer. In particular, the processing or functionality of Figs. 1-3 can 
be implemented as software, or a computer program, executing on the computer. The 
method or process steps for optimally planning and/or allocating hosting-service resources 
are effected by instructions in the software that are carried out by the computer. The 
software may be implemented as one or more modules for implementing the process 
steps. A module is a part of a computer program that usually performs a particular 
function or related functions. Also, as described hereinbefore, a module can also be a 
packaged functional hardware unit for use with other components or modules. 

In particular, the software may be stored in a computer readable medium, 
including the storage devices described below. The software is preferably loaded into the 
computer from the computer readable medium and then carried out by the computer. A 
computer program product includes a computer readable medium having such software or 
a computer program recorded on it that can be carried out by a computer. The use of the 
computer program product in the computer preferably effects advantageous apparatuses 
for optimally planning and/or allocating hosting-services. 

Preferably, a computer system 400 shown in Fig. 4 includes the computer 450, a 
video display 410, and input devices 430, 432. In addition, the computer system 400 can 
have any of a number of other output devices including line printers, laser printers, 
plotters, and other reproduction devices connected to the computer 450. The computer 
system 400 can be connected to one or more other computers via a communication 
interface using an appropriate communication channel 440 such as a modem 
communications path, a computer network, or the like. The computer network 420 may 
include a local area network (LAN), a wide area network (WAN), an Intranet, and/or the 
Internet. 

The computer 400 itself preferably includes a central processing unit(s) 466 
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(simply referred to as a processor hereinafter), a memory 470 which may include random 
access memory (RAM) and read-only memory (ROM), input/output (IO) interfaces 464, 
472, a video interface 460, and one or more storage devices 462. The storage device(s) 
462 can include one or more of the following: a floppy disc, a hard disc drive, a 
magneto-optical disc drive, CD-ROM, magnetic tape or any other of a number of 
non-volatile storage devices well known to those skilled in the art. Each of the 
components is typically connected to one or more of the other devices via a bus 480 that 
in turn can consist of data, address, and control buses. 

The video interface 460 is connected to the video display 410 and provides video 
signals from the computer for display on the video display 410. User input to operate the 
computer can be provided by one or more input devices 430, 432. For example, an 
operator can use a keyboard 430 and/or a pointing device such as the mouse 432 to 
provide input to the computer. 

The foregoing system is simply provided for illustrative purposes and other 
configurations can be employed without departing from the scope and spirit of the 
invention. Computers with which the embodiment can be practiced include IBM-PC/ATs 
or compatibles, one of the Macintosh (TM) family of PCs, Sun Sparcstation (TM), a 
workstation or the like. The foregoing are merely examples of the types of computers 
with which the embodiments of the invention may be practiced. Typically, the processes 
of the embodiments, are resident as software or a program recorded on a hard disk drive 
as the computer readable medium, and read and controlled using the processor. 
Intermediate storage of the program and intermediate data and any data fetched from the 
network may be accomplished using the semiconductor memory, possibly in concert with 
the hard disk drive. 

In some instances, the program may be supplied to the user encoded on a 
CD-ROM or a floppy disk, or alternatively could be read by the user from the network via 
a modem device connected to the computer, for example. Still further, the software can 
also be loaded into the computer system from other computer readable medium including 
magnetic tape, a ROM or integrated circuit, a magneto-optical disk, a radio or infra-red 
transmission channel between the computer and another device, a computer readable card 
such as a PCMCIA card, and the Internet 420 and Intranets including email transmissions 
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and information recorded on websites and the like. The foregoing are merely examples of 
relevant computer readable mediums. Other computer readable mediums may be 
practiced without departing from the scope and spirit of the invention. 

In the foregoing manner, a method, an apparatus, and a computer program 
product for optimally planning and/or allocating hosung-service resources are disclosed. 
While only a small number of embodiments are described, it will be apparent to those 
skilled in the art in view of this disclosure that numerous changes and/or modifications 
can be made without departing from the scope and spirit of the invention. 
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